BIOTECHNOLOGY gp-S- 



RAW SEQUENCE LISTING 
ERROR REPORT 




The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 

Application Serial Number: 01 In IS. 1411 

Source: Ol 
Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 



FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help(aiuspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3.0 replaces the previous DOS-based version of Checker, and is Y2K- 
compUant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can be down loaded from the USPTO website at the following address: 



http://w^ww.uspto.gov/web/offices/pac/checker 



RAW SEQUENCE LISTING DATE: 07/06/2001 

PATENT APPLICATION: US/09/715,417 TIME: 10:55:55 

Input Set : A:\Cural061.APP 

Output Set: N:\CRF3\07062001\I715417.raw 

3 <110> APPLICANT: Shimkets, Richard 

4 Lichenstein, Henri OlSKstte Needed 

5 Vernet, Corine COffeCteO ^ 

6 Fernandes , Elma 

8 <120> TITLE OF INVENTION: NOVEL POLYPEPTIDES AND NUCLEIC ACIDS ENCODING SAME 
10 <130> FILE REFERENCE: 15966-606 

12 <140> CURRENT APPLICATION NUMBER: 09/715,417 

13 <141> CURRENT FILING DATE: 2000-11-16 

15 <150> PRIOR APPLICATION NUMBER: 60/166,336 

16 <151> PRIOR FILING DATE: 1999-11-19 

18 <150> PRIOR APPLICATION NUMBER: 60/167,785 

19 <151> PRIOR FILING DATE: 1999-11-29 

21 <150> PRIOR APPLICATION NUMBER: 60/187,844 

22 <151> PRIOR FILING DATE: 2000-03-08 
24 <160> NUMBER OF SEQ ID NOS : 38 
26 <170> SOFTWARE: Patentin Ver. 2.1 




ERRORED SEQUENCES 

1717 <210> SEQ ID NO: 29 

1718 <211> LENGTH: 3905 

1719 <212> TYPE: DNA 

1720 <213> ORGANISM: Homo sapiens 

1722 <400> SEQUENCE: 29 

1723 ggatttgaga gcctgaactt agccatacac cagatctacc tttggaccgc aaaagggacc 60 

1724 cagtgcttca tgaagctggt tttttttgtt ttgttttgtt ttttttccgt tgttttgttt 120 

1725 cggctttacc aacctgactg ggtgtttttc aatatccacc attcagactt tcctcaacag 180 

1726 cagaggatgt ggcagtggca aagacaaggg gatgggggga gacgaaaggg aaaggggcct 240 

1727 gcatgaaaga ccatgtctgt cttcctgctg gtgccagttc cctgaacctc atcttgttgt 300 

1728 tcagcccctt actgcagcct gcccagggct ccactccatg gcttcatcct aggccagacc 360 

1729 agcacccagc ccgggggctc catccacttt ggctgcaacg ccggctaccg cctggtggga 420 
17 30 cacagcatgg ccatctgtac ccggcacccc cagggctacc acctgtggag cgaagccatc 4 80 

1731 cctctctgtc aagctctttc ctgtgggctt cctgaggccc ccaagaatgg aatggtgttt 540 

1732 ggcaaggagt acacagtggg aaccaaggcc gtgtacagct gcagtgaagg ctaccacctc 600 

1733 caggcaggcg ctgaggccac tgcagagtgt ctggacacag gcctatggag caaccgcaat 660 

1734 gtcccaccac agtgtgtccc tgtgacttgt cctgatgtca gtagcatcag cgtggagcat 720 

1735 ggccgatgga ggcttatctt tgagacacag tatcagttcc aggcccagct gatgctcatc 780 

1736 tgtgaccctg gctactacta tactggccaa agggtcatcc gctgtcaggc caatggcaaa 840 

1737 tggagcctcg gggactctac gcccacctgc cgaatcatct cctgtggaga gctcccgatt 900 

1738 ccccccaatg gccaccgcat cggaacactg tctgtctacg gggcaacagc catcttctcc 960 

1739 tgcaattccg gatacacact ggtgggctcc agggtgcgtg agtgcatggc caatgggctc 1020 

1740 tggagtggct ctgaagtccg ctgccttgct ggacactgtg ggactcctga gcccattgtc 1080 

1741 aacggacaca tcaatgggga gaactacagc taccggggca gtgtggtgta ccaatgcaat 1140 

1742 gctggcttcc gcctgatcgg catgtctgtg cgcatctgcc agcaggatca tcactggtcg 1200 

1743 ggcaagaccc ctttctgtgt gccaattacc tgtggacacc caggcaaccc tgtcaacggc 1260 

1744 ctcactcagg gtaaccagtt taacctcaac gatgtggtca agtttgtttg caaccctggg 1320 



file://C:\CRF3\Outhold\VsrI7 1 541 7.htm 



Page 2 of 4 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/715,417 



DATE: 07/06/2001 
TIME: 10:55:55 



Input Set : A:\Cural061.APP 

Output Set: N:\CRF3\07062001\l715417.raw 



1745 
1746 
1747 
E--> 1748 

1749 
1750 
1751 
1752 
1753 
1754 
1755 
1756 
1757 
1758 
1759 
1760 
1761 
1762 
1763 
1764 
1765 
1766 
1767 
1768 
1769 
1770 
1771 
1772 
1773 
1774 
1775 
1776 
1777 
1778 
1779 
1780 
1781 
1782 
1783 
1784 
1785 
1786 
1787 
E--> 1788 



tatatggctg agggggctgc 
ctgcccacct gcagaatcat 
caggtccacg ccagcggccc 
imaaccacgg cttctacctc 
tgggaccgtc cccgccccca 
cactcccaga tgtctggaga 
ggcaagcgta ctctggtggg 
ggctccctcc ctcactgctc 
gctcatggca tccgtttggg 
gaagctggcc acgtgctccg 



agcggctcgc 
gcccgagttg 
gaaggatact 
ggcagtgacc 
cttcggctgg 
tatatgatgg 
ggaaccaagc 
aaggtggtgg 
taccagctct 
ctgcctcagt 
gaggaccgag 
ctggtgggct 
agctgcatag 
cagaacaatt 
tacctgcttc 
ccacctgact 
ggggccctgg 
ccctcaaggg 
cgcccatctg 
caccgctcac 
gggcctatga 
ttgccaacag 
ctggaactta 
ctgtggagat 
cgtcagagtc 
ggcagttcgg 



agcctgagtg 
tgttcagtga 
acgccacagg 
ctgagtgcct 
gcaatgactt 
agtcacatag 
ccgtctgcaa 
ggtctgactt 
ccctgcccgc 
gtttccctgt 
gcttctccta 
ctccacgcag 
atccgaccct 
ctcagggcta 
agggctccac 
gtgtccccca 



taggtcccaa tgcctggcca gcgggcaatg.. 
caactgtaca gatcctggac accaagaaaa 
gcacaggttc agcttcggca ccactgtgtc 
ctgggcaacc ccagtgctca gctgccaggg 
gtgtctcttg gtgtcctgtg gccatccggg 
cagttatact gtgggagcag tggtgcggta 
aaacagcacc cgcatgtgtg ggctggatgg 
aggaaccagc gtgggagttt gcggtgaccc 
ggacagcttt gatccaggca ctgtgatgcg 
gggatcgtca gagcgcacct gtcaagccaa 
tggagtgatc tcttgtggga accctgggac 
tggcctggtt ttctccagct 
cctgctcagc cgtcactgct 
cgtcataaac tgtggtgacc 
caggtacaac aaaactgtga 
agtatctgtg ctgagctgca 
agctctcatg tgcaagccac 
catgtggggc tcaagtgtga 
ggtgttcacc tgtgagggaa 
gttctgcggg gatcctggtg 
caggtcatct gtctccttct 
gttttgccag tcagatggga 
gaccacgtgt gcggaccctg 
ccaggttgga agcacagtcc 
caccaggacc tgcctcccaa 



cctggcagag 
ccaagccttg 



caagaaagaa 
ctttatgaat 
ctccggagcc 
ctttcaaaga 



ctatcgtcta 
cggtcaatgg 
ctgggattcc 
catatcagtg 
ccaaggaccg 
ctccgctcat 
cttatgcctg 
atgggtcctg 
tcccgtcccg 
cctgccatcc 
catggagtgg 
gtgtgccaca 
tcttccgttg 
acctgacctg 
cgccaacgca 
ctcctgccag 
tggcagctgg 
catcaacact 
caagaattcc 



ccactgcagg cagccagaga 
atttgccctc catgggctac acgctcatta 
tggctccgag caccgcacct gcaaggcgga 
gtccggccca gtgggagacc 
attcctgggg atgtttttgc 
ataccagggg aagaagcagc cagccatgct cagagtgact 
caaggtcaat gccaccatga tcgaccacag tggcgtggag 
gattttcatc tcctactcca ggtgtaccag 
aagttcaaag atgatcactg ggctttagat 
accttcatct accaaggctc tgtcaagggc 
ctggacctca ggctgctgga gtcagacccc 
gccgccactt tgcttccaac agcagctcag tggcagccgc gatcctggtg 
ccctcattat tgcgggcttc gtgctctatc tctacaagca caggagaaga 
ctttcaatgg ctatgctggc cacgagaaca ccaatgttcg ggccacattt 
tgtacgaccg caacatccag cccacagaca tcatggccag cgaggcggag 
gcacagtgtg cacagcagta tagccacccg gcctggccgc tttttttgct 
ggtactccag cagccgccga agctggactg tactgctgcc atctcagctc 
ccctgcctga ttcccctgcc tcagcctgcc gagtgcctgc gattgcaggc 



gagtgacatg 
tagtgttcgt 
ttaccggtgc 
agatggcaca 
ctccccgcct 
cagctgcatc 
acactggact 
tgggatcccg 
cttcagctgt 
tggctcgtgg 
tccaagtaat 
tgagtgccgg 
tacctggaca 
agccaatggc 
tgtccctggc 
gacatggaat 
ccccaatggg 
cctggagggg 
gaccggagag 
tgggaggaga 
ccctctggtg 
cacccagccc 
gtttgggata 
tcaaaaaggc 
gagtggaacc 
tgccaacgtc 
gagggcttct 
acaggcaagc 
gcccgggagc 
ctgtggaaag 
ggcttccaag 
ctgcacttgg 
attacagggc 
ggccatgtct 
caaggctttg 
gagtccattg 
cctttcatcg 
cccaaagttc 
gagaacccaa 
ttcacagtca 
aggttgaact 
actgcaacct 
gcgcaccgcc 



1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3905 



A use of n and/or Xaa has been detected in the Sequence Listing, 
each sequence using n or Xaa. 
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7/6/01 



VERIFICATION SUMMARY DATE: 07/06/2001 

PATENT APPLICATION: US/69/715,417 TIME: 10:55:56 



Input Set : A:\Cural061.APP 

Output Set: N:\CRF3\07062001\I715417.raw 

L:301 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 7 

L:1748 M:340 E: (46) "n" or "Xaa" used: Feature required, for SEQ ID#:29 
M:340 Repeated in SeqNo=29 



L 


2240 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


33 


L 


2272 


M 


341 


W 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


34 


L 


2314 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


35 


L 


2369 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used, 


for 


SEQ 


ID# 


36 


L 


2405 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


37 


L 


2441 


M 


341 


w 


(46) 


"n" 


or 


"Xaa" 


used. 


for 


SEQ 


ID# 


38 
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Raw Sequence Listing Error Summary 



ERROR DETECTED SUGGESTED CORRECTION 



SERIAL NUMBER 



ATTN: NEW RULES CASES: PLEASE DISREGARD ENGLISH -ALPHA" HEADERS, WIHCH WERE INSERTED DY PTO SOFTWARE 

J Wrapped Nuclcics The numbcrAcxl al the end of each line 'Vrappcd" down to the next line. This may oceur ifyour file 

Wrapped Aminos was retrieved in a word processor after creating it. Please adjust your right margin to .3; this will 
, prevent "wrapping." 



_lnvalid Line Lcngtli Tlic ruJcs require that a line not exceed 72 characters in length. This includes uliilc spaces. 

Misaligned Amino The numbering under each 5** amino acid is misaligned. C>o not use tab codes between numbers; 
use space characters, instead. 



Numbering 
4 Non-ASCII 

5 Variable Length 



_Palcnan 2.0 
"bug" 



The suUnilted file was not saved in ASCII(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent submission is saved in ASCII texL 

Sequaicc(s) contain n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable length and indicate in the <220>-<223> section that some may be missing. 

-A "bug" in Patentln version 2.0 has caused tlic <220>-<223> section to be missing from amino acid 

scqucnces(s) , Normally, Patentln would automatically generate this secUon from iJic 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent amino acid sequence. This applies to the mandatory <220>-<22J> sections for 
Artificial or Unkno\vn sequences. 



missing. If inlcnlional, please insert the following lines for each skipped sequence: 



7 Skipped Sequences Sequaice(s) 

(OLD RULES) (2) INFORMATION FOR SCQ ID NO:X: (insert SEQ [D NO where "X" is shoun) 

(i) SEQUENCE CIIARACTERJSTICS: (Do not insert any subheadings under Diis heading) 



(xi) SEQUENCE DESCRlPnON:SEQ ID NO:X: (insert SEQ ED NO where "X" is shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to include tJie skipped sequences. 

missing. If intentional, please insert the following lines for each skipped sequence. 



8 Skipped Sequences Sequcnee(s) 

(NEW RULES) <:2 10> sequence id number 
<400> sequence id number 
000 



Use of n's or Xaa's Use of n's and/or Xaa*s have been delected in the Sequence Listing. 

(NEW RULES) Per 1 .823 of Sequence Rules, use of <220>-<223> is MANDATORY if n's or Xaa' s are present. 

In <220> to <223> section, please explain location of n or Xaa, and wliich residue n or Xaa represents. 



10 



II 



^Invalid <2!3> 
Response 



Useof<220> 



Per 1.823 of Sequence Rules, the only valid <2I3> responses are: Unknown. Artificial Sequence, or 
scientific name (Genus/species). <220>-<223> section is required when <2I3> response is Unknown or 
is Artificial Sequence 



Sequcnce(s) 



missing tlie <220> "Feature" and associated numeric identifiers and responses. 



12 



PalaitIn 2.0 
""bug- 



Use of <220> to <:223> is MANDATORY if <213> "Organism" response is "Artificial Sequence" or 
"Unknown." Pleaise explain source of genetic material in <220> to <223> section. 
(See 'Tcdcral Register," 06/01/1998. Vol. 63, No. 104, pp. 29631-32) (Sec. 1.823 of Sequence Rules) 

Please do not use "Copy to Disk" function of Patentln version 2.0. This causes a corrupted file, 
resulting in missing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use "File Manager" or any other manual meaj\s to copy file to floppy disk. 



AMC - Biotechnology Systems Branch - 06/04/2001 



